PCT 



WOEU-D INTELLECTUAL PROPERTY ORGANIZATION 
Ititemattonal Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 3 

C07H 17/00, 19/06, 15/12 
C12Q 1/68 



Al 



(11) International Publication Number: WO 84/ 03285 

(43) International Publication Date: 30 August 1984 (30.08.84) 



(21) International Application Number: P(n'/US84/00279 

(22) IntemaHonal Filing Date: 22 February 1984 (22.02.84) 



(31) Priority Application Numbers: 
(32^ Priority Dates: 
(33) Priority Countries: 



PCT/US83/00254 
468,498 



22 February 1983 (22.02.83) 
22 February 1983 (22.02.83) 

AU 
US 



(60) Parent Application or Grant 
(63) Related by Continuation 

US 468,498 (CIP) 

Filed on 22 February 1983 (22.02.83) 



(71) Applicant (for all designated States except US): MO- 
• LECULAR BIOSYSTEMS, INC. [US/US]; 1 1 180 Ro- 
seUe Street, Suite A, San Diego, CA 92121 (US). 



(72) Inventor; and 

(75) Inventor/AppUcant (for US only) : RUTH, Jeny, L. [US/ 
US]; 2433 Levante Street, Carlsbad, CA 92008 (US). 

(74) Agent: SCHWALBACH, Joseph, C; 1010 Second Ave- 
nue, Suite 1607, San Diego, CA 92101 (US). 

(81) Designated States: AU, CH (European patent), DE 
(European patent), DK, FR (European patent), GB 
(European patent), /P, NL (European patent), NO, 
SE (European patent), US. 



Published 

With international search report. 
Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt 
of amendments. 



(54) Title: DEFINED SEQUENCE SINGLE STRAND OLIGONUCLEOTIDES INCORPORATING REPORTER 
GROUPS, PROCESS FOR THE CHEMICAL SYNTHESIS THEREOF, AND NUCLEOSIDES USEFUL 
IN SUCH SYNTHESIS 

(57) Abstract 

Defined sequence single strand oligonucleotides which have a length of fewer than 200 units, incorporate modified 
nucleotide units which are p3rrimidine or purine-based, and which may have readily detectable reporter groups attached to 
substituents on the modified nucleotide units thereof. Each reporter group is attached to the base of the respective nucleo- 
tide unit at a sterically tolerant site thereon, exemplified by the C-5 position of pyrimidine-based nucleotides and the C-8 
position of purine-based nucleotides. Such oligonucleotides are useful in identification, localization, isolation and/or de- 
tection of complementary nucleic acid sequences of interest in cellular or cell-free systems. Also disclosed is a process for 
the chemical synthesis of single strand oligonucleotides having a predetermined number of modified pjoimidine and/or 
purine-based nucleotide units in predetermined sequence, to the bases of which readily detectable reporter groups are att- 
ached at sterically tolerant sites through substituents thereon, either prior to or after incorporation of such units into the ol- 
igonucleotide chain. Novel nucleosides useful in the chemical synthesis of labeled, defined sequence single strand oligonu- 
cleotides are also disclosed. 



f 







FOR THE PURPOSES OF INFORMATION ONLY 




Ctodes used to identify States party to the per on the front pages of pamp Wets publish 


cations under the PCT. 








AT 


Austria 


GA 


Gabon 


MR Mauritania 


AU 


Australia 


GB 


United Kingdom 


MW Malawi 


BB 


Batbados 


HU 


Hungary 


NL Netherlands 


BE 


Belgium 


rr 


Italy 


NO Norway 


BG 


Bulgaria 


jp 


Japan 


RO Romania 


BR 


Biaza 


KP 


Democratic People's Republic 


SD Sudan 


CF 


Central Afrit^ Republic 




of Korea 


SE Sweden 


CG 


Congo 


KR 


Republic of Korea 


SN Senegal 


CH 


Switzeiiand 


LI 


Liechtenstein 


SU Soviet Union 


CM 


Cameroon 


LK 


Sri Lanka 


TD Chad 


D£ 


Germany^ Federal Republic of 


LU 


Luxembourg - 


TG Togo 


DK 


Denmaifc 


MC 


Monaco 


US United States of America 


n 


Hnland 


MG 


Madagascar 




FR 


France 


ML 


MaU 





wo 84/03285 



^1 PCTAJS84/00279 



-1- 



De script ion 



Defined Sequence Single Strand Oligoniacleo tides 
Incorporating Reporter Groups, Process for the Chemical 
Synthesis Thereof, and Nucleosides Useful in Such Synthesis 



5 Technical Field 

The present invention relates to defined sequence 
single strand oligonucleotides having a length of fewer 
than 200 base units and which contain one or more 
nucleotide units to the base of which is attached a 

10 substituent group capable of functioning as one or more 
detectable reporter groups, or which has bound thereto 
one or more detectable reporter groups; such oligonucleo- 
tides being useful in the identification; localization 
and detection of complementary nucleic acid sequences 

15 of interest in cellular or cell-free systems. 

Background Art 

The enzymatic production of labelled, double stranded 
deoxypolynucleotides has been accomplished with prior art 
techniques involving the incorporation of radioisotopes 

20 into double-strand DNA by the nick translation protocol 
of P. Rigby et al, J. Mol. Biol. 113: 237-251 (1977), 
or the gap-filling reaction described by G, Bourguignon 
et al, J. Virol, 20: 290-306 (1976). Specifically, a nick 
is introduced via DNase, and then translated along the 

25 DNA strand using a DNA polymerase. During the nick 

translation procedure, DNA polymerase from E. coli (Pol I) 
will, in the presence of added deoxynucleoside triphos- 
phates, condense nucleotides to the 3' hydroxyl terminus 
in a single strand nick region of double- stranded DNA. 

30 Simultaneoiasly the enzyme will remove nucleotides from 
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the 5* end of the nick. .1 f one or more of the added 

■ ' - 32 

triphosphates is labelled, for examole, with a-. P- 

phosphate » the label will be incorporated into the new 

strand by Pol I. Following the gap-filling procedtire, 

5 recessed ends left after restriction fendonuclease cutting 

can be filled. in with Klenow fragments from Pol I or 

T4 DNA Polymerase - 

Both the nick translation and the gap-filling 

procedixres will yield double- stranded labelled DNA. 

10 The length of the product depends upon how much DNase I 
is added to the reaction- This is usually optimized so 
that thirty percent of the label is incorporated and 
the strands are 400 to 800 nucleotide units in length - 
The product length is unpredictably heterogeneous within 

15 the 400 to 800 unit range. In order to -conserve enzyme 
as well as labelled nucleotide, only one microgram of 
DNA is usually labelled in each reaction vessel. 

Double-stranded polynucleotides which incorporate 
pyrimidine bases modified by carbon chains, at the C-5 

20 position have been prepared enzymatically in a similar 
manner . This has been done by enzymatic elongation of 
a homopolymeric primer- temp late as reported by J. Sagi 
et al., Biochem . Biosphys . Acta. 606: 196-201 (1980), or 
by nick translation/gap filling by DNA polymerases using 

25 2 • -deoxyur idine 5 * -triphosphate covalently linked to bio- 
tin as reported by P. Langer et al, Proc . Nat . Acad. Sci. 
USA 78:6633-6637 (1981), the biotin being capable of 
acting as a recognition site for avidin. 

Enzymatic methods described by Rigby, et.al, 

30 Bourguignon, et al, and Langer, et al, result in products 
having similar physical characteristics. Such enzymatic- 
ally-prepared polynucleotides are 400-800 units in length. 



wo 84/03285 



PCTAJS84/00279 



require double- stranded polynucleotides as starting 
materials, produce double -stranded polvnucleotides 
in all caises, and do not allow labeling at pre-selected 
sites . 

5 / In addition, all enzymatic methods modify both 

strands of the polynucleotide, . and such product strands 
cannot be isolated from one another. During such pro- 
cesses, enzymes replace either all units with modified 
units or, when provided with a mixture of modified 
10 and naturally-occxirring nucleoside triphosphates » 

randomly insert modified units. Furthermore, the enzy- 
matic process described by Langer, et al . is incapable 
of producing polynucleotides incorporating such reporter 
groups as fluorescent, luminescent, or antigenic reporter 
15 groups. None of this art is capable of producing oligo- 
nucleotides of defined length, defined sequence or single- 
stranded character, either with 6r without reporter groups. 
Moreover, by the prior art methods, modified bases having 
reporter groups attached thereto cannot be incorporated 
20 in a polynucleotide at preselected sites. 

Double- stranded polynucleotides which incorporate 
adenine bases modified at the C-8 position have also 
been prepared enzymatically . This has been done by 
incorporating 8-aminohexylamino-ATP (a ribonucleotide) • 
25 onto DNA fragments, as reported by C. Vincent et al, 
Nucl . - Acids Res . 10:6787-6796 (1982). The method is. 
limited in scope, however, allowing only 3 '-end 
labeling with triphosphates of adenine ribonucleo- 
tides. No modified pyrimidine nucleosides can be 
30 incorporated. Fturthermore , as with other enzymatic 
methods, both strands of a double-stranded polynucleo- 
tide are labeled and no short (< 100 units) oligonucleo- 
t?ides of defined sequence can be produced. 
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The prior art enzymatic methods referred to above 
reqxiire the, chemical synthesis of a substituted 
nucleoside 5' -triphosphate, and demand subsequent 
enzymatic recognition and incorporation of the unnatural 
5 substrate into nicked (fbuble-stranded DNA, Such methods 
are incapable of producing polynucleotides ^of any pre- 
selected length or sequence, and t±ie polymerases and 
DNases used therein are expensive. Moreover, these 
methods are time consijming, inefficient, demand sxxbstan- 

10 tial enzymatic activity and are limited to double -stranded 
DNA. Only small amounts, i.e., micrograms, of ill-defined 
polynucleotides, usually restriction fragments, are 
produced, iand these must be tediously isolated from 
natural soinrces . Moreover, the scope of modifications 

15 obtainable in the polynucleotide product is severely 

limited, since the DNA polymerase cannot recognize or in- 
corporate potentially useful reporter groups such as 
fluorescein or dinitrophenyl . 

Attachment of one fluorescent molecule to the 3' end 

20 of long polyribonucleotide molecules (RNA) .for limited 
biological application is disclosed by J.G.J. Bauman et 
al, J. Histochem. Cytochem. 29:238 (1981). This approach 
also used very small amounts (microgram quantities) of 
RNA tediously isolated from natural sources using enzy-' 

25 matic methodology, and cannot be applied to DNA since 

both 2' and 3' hydroxyls are required therefor. The much 
greater chemical instability of SNA relative to DNA also 
minimizes the scope of application of the polyribonucleo- 
tide produced by this method, 

30 The non- enzymatic synthesis of defined sequence 

oligonucleotides incorporating nattif ally-occurring nucleic 
acid bases. has been reported or reviewed by S.A. Narang 
et: al, Meth. Enzymol 65:610 (1980), R. Letsinger, J^ Org. 
Chem. 45:2715 (1980). M. Matteucci et al. J^ Amer. Chem. 
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Soc. 103:3185 (]982), and G. Alvarado-Urbina et al. 
Science 214:270 (1981). Such synthesis usually in- 
volved chain extension by cotipling an activated 
nucleotide monomer and a free hydroxyl-bearing 
5 terminal unit of a grox/ing nucleotide chain • The 
coupling is. effected' through a phosphorus-containing 
group, as in the phosphate triester method reviewed 
f by Narang et al, or one of the phosphite triester 
methods. Of the latter, those of Letsinger et al 

10 and Alvarado-Urbina et al use phosphochloridite 

chemistry, and that of Matteucci et al uses phospho- 
amidite chemistry. 

The aforementioned chemical synthesis of oligo- 
nucleotides has incorporated only unmodified or 

15 naturally-occurring nucleic acid bases , and the end 
product there of is in short fragments and resembles 
unmodified or naturally-occurring RNA or DNA. It is 
also worthy of note that the end product of such 
synthesis does not incorporate any labels or reporter 

20 group s . 

Direct modification of homopolymeric polynucleo- 
tides has been reported in systems of polyuridylic 
acid [Bigge, et al, J. Carb. , Nucleosides , Nucleotides 
8:259 (1981)]. The reported procedure is of limited 

25 scope and is not productive of useful products. 

Treatments described cause extensive polynucleotide 
cleavage and degradation, result in product irrever.- 
sibly contaminated with metal ions, and are capable 
of modifying only cytosine residues of a DNA polynu- 

30 cleotide- The method is incapable of producing 
defined sequence oligonucleotides of specific 
lengths, cannot modify thymine- or purine bases, and 
^cannot modify at previously selected sites. 
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From the foregoing it will be apparent that, 
labelled, defined sequence polynucleotides have been 
produced heretofor only by enzymatic methods which 
have the disadvantages pointed out herein, ^particu- 
5 larly those relating to time, cost, product length, 
sequence and yield* Moreover, such methodology is 
productive of only double- stranded products. Such 
double -stranded polynucleotides can be denatured by, 
alkali or heat to cause spontaneous short-term separa- 

10 tion of strands in solution. However, the individual 
single strands cannot be physically isolated from each 
other, and removal of the denaturing conditions results 
in rapid renaturation to double- stranded form* Since 
conditions productive of hybridization are also pro- 

15 ductive of renatxaration, subjecting the denatured 

polynucleotide to hybridization conditions results in 
return of the polynucleotide to its original double- 
stranded configuration in which hybridization of either 
of the strands to a target polynucleotide is limited 

20 by competition from the other strand. 
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Modif ication of nucleosides has been undertaken, 
for example, in the synthesis of antiviral C-5 sub- 
stituted pyrimidine nucleosides disclosed by Bergstrom, 
et al, J, Amer, Chem, Soc. 98:1587-1589 (1976); Ruth, 
5 et al, Org. Chem, 43:2870-2876 (1978); and Bergstrom, 
et al, U. S. Patent Nos. 4,247,544 and 4,267,171, or 
in the synthesis of C-8 substituted adenine derivatives 
disclosed by Zappelli, et al, U. S. Patent Nos. 
4,336,188 and 4,199,498. These nucleosides, and 
10 others reported by Langer, et al, and D. Ward, European 

Patent Application No. 0063879, are not useful in ^ 

the process of the present invention. Such reported 
nucleosides are highly reactive at \mdesired sites, 
and, if used in oligonucleotide synthesis by chemical 
15 methods, v7ould result in vindesired /Side products, uncon- 
trollable synthesis, and no -desired product. Moreover, 
such reported nucleosides do not contain proper sites 
in the substituent group , cannot be modified by the 
attachment of reporter groups, .nor do they contain 
20 masked reactive functionalities. No such nucleosides 
are useful in the process of the present invention. 

No chemical synthesis of defined sequence oligo- 
nucleotides incorporating modified bases of any kind, 
either with or without reporter groups, has been 
2S disclosed in the prior art. 

There is an xirgent need for high quality labelled, 
defined sequence single strand oligonucleotides which 
do not involve hazardous and unstable radioisotopes, 
and satisfaction of this need is a principal obi act of 
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the present invention. This object is accomplished by 
a chemical,, i.e., nonenzymatic., process which provides 
a predictable, superior product in high yield. More 
particularly, the process of the present invention 
5 accomplishes the chemical incorporation into defined 
sequence oligonucleotides of nucleotides modified with 
' a wide variety of selected detectable reporter groups, 
such oligonucleotides being xiseful, for example, for 
the identification, localization, isolation and/ or 
10 quantitation of complementary sequences of interest. 

Another object of the invention is to provide 
novel nucleosides useful in the chemical synthesis of 
labelled, defined sequence single strand oligonucleo- 
tides - 

15 The process of the present invention accomplishes 

the de novo chemical synthesis o:^ labelled, defined 
sequence oligonucleotides and is superior to prior 
art enzymatic methods in a number of respects. .More 
specifically the process of the present invention makes 

20 possible the synthesis of labelled, defined sequence 
single- stranded oligonucleotides of homogeneous pre- 
dictably defined length, preferably having fewer than 
200 base xmits , in contrast to the production of 
heterogeneous tmpredic table populations of 400 to 

25 10,000 base tinits of double -stranded character produced 
by prior art enzymatic methods. 

The yield of product produced by the process of 
the present invention is of the order of hxxndreds to 
tens of thousands of micrograms, in contrast to the 

30 yield of a few micrograms provided by the prior art 

enzymatic methods. Moreover, the product oligonucleo- 
4ldes of the presient invention are single- stranded, 
rather than the double- stranded products of enzymatic 
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methods. The single strand conf igtiration of the 
product oligonucleotides avoids the competition from 
a complementary strand inherent in rehybridization of 
double-stranded "polynucleotides,. 

5 DisclosTjre of Invention 

The invention comprises a defined sequence single 
strand oligonucleotide of fewer than about 200 base 
xmits in length which comprises ,at least one nucleo- 
tide unit having attached to the base thereof at a 

10 sterically tolerant site (such as C-5 of pyrimidines 

and C-8 of purines) a substituent group which is capable 
of functioning as one or more detectable reporter 
groups, or of binding one or more detectable reporter 
groups. The invention also comprises a process for the 

15 chemical, i.e., nonenzymatic, synthesis of a defined 
sequence single strand oligonucleotide which comprises 
coupling an activated nucleotide monomer and a free 
hydroxyl -bearing terminal unit of a growing nucleotide 
chain, at least one of the monomer and terminal unit 

20 having its base modified at a sterically tolerant site 
by attachment thereto of a substituent group capable of 
fxmctioning as one or more detectable reporter groups, 
or of binding one or more detectable reporter groups. 
The invention additionally comprises novel nucleosides 

25 useful in this synthetic process. 

The oligonucleotide produced by the process of the 
present invention may include one or more pyrimidine- 
based or purine-based units which may be ribonucleotides 
or deoxyribonucleo tides, and prior to the synthesis 

30 thereof, the reporter groups are preselected, as are 
the particular nucleotide units to which the reporter 
groups are attached • 
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Best Mode of Carrying Out the Invention 

The chemical process by which the defined sequence 
single strand oligonucleotides of the present invention 
are preferably synthesized comprises covtpling an acti- 
5 vated nucleotide monomer and a free hydroxyl -bearing 
terminal tmit of a growing nucleotide chaio, at least 
one of said monomer and terminal unit having its base 
modified at a sterically tolerant site by attachment 
thereto of a stibstituent group capable of fimctioning as 

10 one or more detectable reporter groups, or of binding one 
or more detectable reporter groups. 

The SLibstituent groups of the present invention 
which are capable of binding reporter groups can be 
generally characterized as those which exhibit nucleo- 

15 philic properties. Exemplairy of such groups are those 
which contain primary or aromatic amines", carbojq^^lic 
acids, hydroxyls and the like. 

The bases of the nucleotide monomer and terminal 

20 unit are selected to provide the predetermined sequence 
of nucleotide units desired in the end product oligo- 
nucleotide. Such bases can take the form of the purines 
adenine (A) , guanine (G) , or hypoxanthine (H) , or of the 
pyrimidines uracil (U) , cytosine (C) , or thymine (T) . 

25 Such bases may also take the form of any other base which 
can be isolated from natural soturces. 

A sterically tolerant site on a nucleotide unit can 
be defined as a position on the nucleic acid base of the 
xmit at which modification of the mit can be effected 

30 by attachment thereto of the substituent group without 
causing significant interference with hybridization of 
■ the product oligonucleotide and complementary nucleic 
licid components, and without sterically preventing the 
substituent groxip from fmctioning as one or more 

35 reporter groups , or from binding one or more reporter 
groups. Sterically tolerant sites are found at the C-8 



wo 84/03285 



^ PCT/US84/00279 



-11- 

position of piirines and at the C-5 position of pyrimi- 
dines. Since the oligonucleotides of the present 
invention, are particularly useful as hybridization 
probes, modifications which attach substituent and/or 
5 reporter groups should not be at sites on the pyrimi- 
dine or purine bases which are necessary for specific 
hybridization. Sites which should not be modified 
include N"*- and of adenine bases; N"^, N^, ,and 0^ 
of guanine bases; N and N of cytosine bases. Gener- 
ic ally, substitution at any heteroatom (N or 0) should 
be avoided. 

A reporter group can be defined as a chemical group 
which may be aromatic and/or polycyclic and which has 
a physical or chemical characteristic which can be 

15 readily measured or detected by appropriate physical 
or chemical detector systems or procedures. Reporter 
groups which are useful in oligonucleotides of the pre- 
sent invention are readily detectable- Ready detect- 
ability may be provided by such characteristics as 

20 color change, luminescence, fluorescence, or radio- 
activity; or it may be provided by the ability of the 
reporter group to serve as a ligand recognition site. 
Such groups are termed fxmctionally colorimetric, 
Itominescent , fluorescent, radioactive or ligand recog- 

25 nition groups. Among such groups are those suitable 

for ready detection by conventional detection techniques, 
such as colorimetric, spectrophotometric, fluorometric 
. or radioactive detection, as well as those which are 
capable of participating in the formation of specific 

30 ligand-ligand complexes which contain groups detectable 
by such conventional detection procedures. 

A reporter group as used herein is defined as a 
substituent group which has physical, chemical or other 
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characteristics which can be readily measxired or de- 
tected by the use of appropriate meastarement or detec- 
tion procedtires. A reporter group as defined herein 
also includes a substituent group which is capable of 
5 initiating one or more.. interactions which ultimately 

provide a reaction product or a complex having physical, 
chemical or other characteristics which can be readily 
measxired or detected by the use of appropriate meastare- 
ment or detection procedures. 

10 Exemplary of the measxorable or detectable charac- - 

teristics which such groups, reaction products or com- 
plexes may exhibit or induce are a color change, 
limiinescence, fluorescence or radioactivity. Such 
characteristics can be measured or detected by the use 

15 of conventional color ime trie , spectrophotometric, 

fluorometric or radioactivity sensing instrumentation. 

The interactions which useftilly can be initiated 
by the reporter group defined herein include appropri- 
ately specific and selective ligand-ligand interactions 

20 productive of groups or complexes which are readily 

detectable, for exanqple, by color imetric , spectrophoto- 
metric, fluorometric, or radioactive detection proce- 
dxores. Such interactions may take the form of protein- 
ligand, enzyme- substrate, antibody- antigen, carbohydrate 

25 lectin, protein-cof actor , protein-effector, nucleic 

acid- nucleic acid or nucleic acid-ligand interactions. 
Exemplary of- such ligand-ligand interactions are 
dinitrophenyl-dinitrophenyl antibody, biotin-avidin, 
oligonucleotide-complementary oligonucleotide, DNA-DNA, 

30 RNA-DNA and NADH-dehydrogenase. Other useful interac- 
tions will suggest themselves to those skilled in the 
^ar t . 
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In the process of the present invention, a selected 
reporter group or groups can optionally be attached to 
the nucleotide monomer before coupling of the monomer 
to the terminal omit of the nucleotide chain, or it 
5 can be attached to the product oligonucleotide after 

formation thereof. The sequence of the nucleotide units 
in the product oligonucleotide is preselected to provide 
such oligonucleotide with precise specificity for its 
ultimate use. The oligonucleotides of the present 
10 invention are useful tools in recombinant DNA and other 
protocols involving nucleic acid rehybridization tech- 
niques. Among such uses are identification, localiza- 
tion, isolation and/or quantitation of complementary 
sequences of interest in cellular or cell-free systems. 
15 More specifically, such uses may include diagnostic 
applications or any fundamental biological event in- 
volving. hybridization of nucleic acid components , or 
purification of complementary sequences by affinity 
chromatography when the product oligonucleotide is 
20 attached to a solid support through the modifications 
at a sterically tolerant site, with or without sub- 
sequent detection. 

The nucleotide units in the product oligonucleotide 
may be purine or pyrimidine-based and may comprise units 
25 having naturally-occiirring bases intermixed with xinits 
having modified bases. Such units may be ribonucleo- 
tides or deoxyribonucleotides. The coupling step pre- 
ferably involves coupling of a monomer mit activated 
at the 3' position with a free 5' hydroxyl of the ter- 
30 minal unit of the growing nucleotide chain. Alterna- 
tively such coupling can involve coupling of a monomer 
unit activated at the 5' position with a free 3* hydroxyl 
of the terminal unit of the nucleotide chain. The 



OMPI 
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teanninal unit may be the initial or only unit in the 
growing nucleotide chain at the time of coupling 
thereto pf the nucleotide monomer, or it may be the 
terminal one of a plurality of nucleotide units. 
5 The process of the present invention produces 

defined sequence oligonucleotides of the following 
generic formula: 



HO 




FORMULA I 

I>Jherein n is 1 to about 199, preferably about 5 to- 
10 about 60, and most preferably about 10 to about 40, R' 
is hydrogen or hydroxy, and B is any of the naturally- 
occurring purine or pyrimidine bases adenine, guanine, 
cytosine, uracil., thymine, or any other naturally- 
occurring base, the nucleotide tinits having naturally- 
15 occiarring bases being independently intermixed with one 
or more nucleotide mits having modified bases (B^) . 
The modified pyrimidine bases (Py^) are substituted at 
the C-5 position, and typical examples thereof are the 
uracil and cytosine bases illustrated by the following 
20 generic formulas: 

( _ OMPI A 

Uk^ wipo 
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NH, 



modified laracil base modified cytosine base 



The modified purine bases (Pu'^) are substituted at the 
C-8 position, and typical examples thererof are the 
modified adenine and guanine bases illustrated by the 
5 following generic formulas: 




modified adenine base modified guanine base 

Ihe substituent group R is characterized by its ability 
to function as one or more reporter groups, or to bind 
one or more reporter groups . In the modified pyrimi- 
10 'dine bases the substituent group R comprises two or 

niore carbon atoms, whereas in the modified purine bases 
R comprises one or more carbon atoms. In this context, 
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R preferably takes the form of one of the following 
ftmctionalized carbon chains : 

R= -CH=CRj^R2, -CH2CHR-LR2, -CHR-,^R2, 

-Cft=CR^- §NHR2 , or -CB<3l^-(i-R2 

5 wherein R-,^ is hydrogen or alkyl; R2 is alkyl, alkenyl, 
aryl, or functionalized alkyl, alkenyl, aryl wherein 
fxmctional groiops include one or more amines , amides , 
nitriles, carboxylic acids and esters, hydroxy, dini- 
trophenyl, aminobenzenesxilf onates , or the like; and Z 

10 is a polyvalent heteroatom such as nitrogen, oxygen 
or sulfur. In addition, R2 may be attached to a solid 
support, or to one or more reporter groups which fmc- 
tion, for example, ds a colorimetric, fluorescent, 
luminescent, radioactive, or ligand recognition group. 

15 Functionally fluorescent grotaps include fluoresceins, 
rhodamines, and the like or adducts thereof; fxmc- 
tionally luminescent grot5)s include lumi.no Is , acri dines , 
lucif erins , dioxetanes, dioxamides , and the like or 
adducts thereof. Ligand recognition groups include 

20 vitamins (such as biotin, iminobiotin or desthiobiotin) , 
antigens such as dinitrophenols , carbohydrates and 
other functional groups or adducts of such groups which 
can be recognized .by ligand- like interactions with 
proteins, or from which such ligand- like interactions 

25 can be elicited. Another oligonucleotide capable of 
interaction with nucleic acids is illustrative of a 
group from which a ligand-like interaction can be 

- . elicited. Ligand recognition groups may also serve as 
functionally colorimetric reporter groups when recog- 

30nition results in dye formation. For example, when 
dinitrophenyl is used as a reporter group, known 
detection systems using an an ti dinitrophenyl antibody 
c^oupled to peroxidase can be used as a detection system, 
resulting in a color change. Fxmctionally radioactive 

35 groups incorporate a radioactive element in the chosen 
reporter group. ^ 

f CMPl 



wo 84/03285 



9 PCT/US84/00279 



-17- 



10 



When reference is made herein to the use of purine 
or pyrimidine bases, such expressions are intended to 
include analogs of such bases. Among such analogs are 
the analogs of pxirine bases, such as the deazaadenosines 
(tubercidins , formycihs ", and the ' like) , and the analogs 
of pyrimidine bases, such as deazatiracil , 4eazacytosine , 
azauracils , azacytosines , and the like. 

Oligonucleotides of Formula I are best prepared by 
chemical synthesis from monomer nucleotide analog units 
of the formula: 



I 

o 

:p-0-R. 



or 



B 



O R' 



FORMULA II 



3' 
I 

o 



0=P.0-R3 

O" 



where : 
Rg = methyl, chloro- 
phenyl 
X = chloro, dialkyl- 
amino, morpholino 



wherein is trityl (triphenylmethyl) , dimethoxytrityl , 
or other appropriate masking group for the S'-hydrox/l; 
B and R' are masked, if appropriate; and (f) represents 

15 a phosphorus -containing group suitable for intemucleo- 
tide bon-d formation during chain extension in synthesis 
of a product oligonucleotide. The phosphorus -containing 
groups (P) suitable for intemucleotide bond formation 
ajre preferably alkyl phosphomonochloridites or alkyl 

20 phdsphomonoamidites . Alternatively phosphate triesters 
may be employed for this purpose. The monomer unit may 
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alternatively have attached at the 3' hydro3cjrl and 
@ attached at the 5'-hydroxyl. 

Generally, the term "masking groiq)" or '^blocking 
groxjp" is a functional expression referring to the 
5 chemical modification or '*b locking" of an integral 
ftmctional group by attachment of a second jnoiety to 
disgxxise the chemical reactivity of the functional 
groxap and prevent it from reacting in an imdesired 
manner. Such modification is reversible, and allows 

10 subsequent conversion back to the original functional 
group by stiitable treatment. In'many cases, such mask- 
ing formally interconverts structural ftinctionality , 
e.g., a primary amine masked by acetylation becomes a 
substituted amide which can be later converted back to 

15 the primary amine by appropriate hydrolysis. 

The compounds of Formula I include the acceptable 
conjugate acid salts thereof. Conjugate acids which 
may be used to prepare such salts are those containing 
•nonreactive cations and include, for example, nitrogen- 

20 containing bases such as ammonium salts, mono-, di-, 

tri- or tetra-substituted amine salts, and the like, or 
suitable. metal salts such as those of sodixjm, potassium, 
and the- like. 

The process steps of the present invention will now 
25 be generally described and illustrated diagrammatically . 
Thereafter, the. invention will be illustrated more speci- 
fically and detailed examples therof provided. ' Since the 
.invention relates to .oligonucleotides incorporating both 
pyrimidine-based and piirine-based nucleotide units, the 
30 use of both pyrimidine and purine-based compoxmds in the 
synthetic process will be illustrated. The specific 
pyrimidine and purine-based compoimds illustrated are 
only exemplary of the respective pyrimidine and purine" 
classes, and it is to be understood that any other member 
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of the respective class can be substituted therefore 
in the process and the product oligonucleotide, when- 
ever suitable or desired. While deoxyribonucleotide 
compounds are shown for the most part, it is under- 
5 stood that ribonucleotide compounds are also con- 
teii5>lated by the invention and can be substituted for 
the deoxyribonucleotide compoimds wherever ribonucleo- 
tide compounds are desired in the product oligonucleo- 
tide. 

10 One of the more important aspects of the invention 

is the provision of a new class of nucleosides which 
are essential as intermediates in the process for syn- 
thesizing the new oligonucleotides. Such nucleosides 
each have a base which is modifie'd by a substituent 

15 group comprising a functionalized carbon chain and one 
or more amides, the nitrogen of the amides being attached 
to a sterically tolerant site on the base through the 
carbon chain. In the case of pyrimidine-based nucleo- 
sides, the carbon chain is attached at the C-5 position, 

20 and in the case of "the purine-based nucleosides, the 
carbon chain is attached at the C-8 position through 
a polyvalent heteroatom, such as nitrogen, oxygen or 
sulfur. In addition, such nucleosides are chemically 
blocked at the 5' position (or the 3' position) with 

25 a group, such as dimethoxytrityl , appropriate for the 
chemical synthesis of oligonucleotides. 
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In the new class of nucleosides the substituent 
group may be" chosen from -CH2CHR^C^H2^Y, -CH=CRjC^H2^Y 

0 a 

-aM3lj^-(i-NHC^H2j^Y, or -CH=CRt CC^H2^Y wherein ,Rj^ is 
hydrogen or Cj^_g lower alkyl and Y is one or more amido, 
5 substituted amido, substituted amino or substituted 
amino alky Iphenyl groups. More specifically, Y may 

include one or more -NHCCX^ wherein X is hydrogen, 

fluorine or chlorine. Synthesis of these nucleosides, 

as well as of the masked forms thereof, is described 

10 hereinafter in Examples I, II, IV, VI, VII, VIII, 

X, XII, and XIII. Preferred nucleosides incorporate 

0 

the substituent group -CH=CHC-NHC^H2^Y at C-5 of 
pjrrimidine nucleosides wherein n = 3 to 12 and Y is 

-NHCCX^- Most preferred are such nucleosides wherein 

15 the pyrimidine base is xiracil. 

The process of the present invention may be 
initiated by the preparation of the selected nucleoside. 
Generally, the most preferred nucleosides are best 
prepared in the following manner. 5- (Methyl 3-acrylyl) 

20 -2'-deoxyuridine is prepared from 2*-deoxyTiridine by 
the method of Bergstrom and Ruth [ J. Amer . Chem. Soc. 
96:1587 (1976)]. The nucleoside is then treated with 
1.05 equivalents of dimethoxytrityl chloride in 
pyridine for 4 hours to block the 5 ' . hydroxyl with 

25 dimethoxytrityl (DMT) . The resulting product is puri- 
fied by silica bhromatography eluting a gradient of 
0-lOZ methanol in chloroform containing 2Z triethy- 
lamine. The purified 5 -DMT-5- (methyl 3-acrylyl)- 
2-' -deoxyuridine is treated with 1 N KOH for 24 hr. 

30 at ambient temperature to hydrolyze the methyl ester. 
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The resulting 5'-DMT-5-(3-acrylyl-)-2'-deoxyuridine 
is treated with excess dicyclohexylcarbodiimide and 
hydroxybenztriazole in pyridine. After 4 hours, a 
2-5 fold excess of 1 ,7-diaininoheptane is 
5 added, and the reaction stirred overnight. -After 
12-20 hours, a 10-20 fold excess of trif luoroacetic 
anhydride is added, and the reaction stirred at room 
temperature for 4 hours. The' product is purified 
by silica chromatography eluting a gradient of 0-10% 

10 methanol in chloroform containing 2% tr ie thy 1 amine , 
followed by exclusion chromatography using Sephadex 
LH-20 eluting 1% triethylamine in methanol. Appro- 
priate fractions are combined to yield 5'DMT-5- 
[ N- ( 7 - tr if luor oace ty laminohep ty 1) - 1 - aery lamido ] - 2 ' - 

15 deoxyuridine; such product is appropriate for oligo- 
nucleotide synthesis by the phosphochloridite proce- 
dure described in Examples XV and XVIII. Alternatively, 
such a compoxmd can be prepared by the combination of 
methods described in Examples II and III. Replacing 

20 diaminoheptane in this process with other diamino- 

alkanes (e.g. , diaminopropane , diaminohexane , diamino- 
dodecane) is productive of other compounds of varying 
< substituent length wherein n = 3, 6, or 12 and 

R = -CH=CHC-NHC NHCCXo- Two such nucleosides, one 
n zn o 

25 pyrimidine (uracil>based and the other purine 

(adenine) -based, are shown at the top of the diagram 
below illustrating the process. Reactive sites on 
the bases of the nucleosides are then masked, as shown 
in Reaction 1, by attachment of, for example, a 

3C benzoyl group (Bz) to the amine at the 



OMPI 
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6 position of the adenine-based nucleoside. Such 
masking is generally described in "Synthetic Pro- 
cedtires in Nucleic Acid Chemistry", Vol. 1, W. Zorbach 
and R. Tipson eds. (Wiley - Interscience, NL. Y,) 
5 1968) . Unprotected amines on the substituent group 
are masked, for example, by attachment thereto of 
trifluoroacetyl groups (Ac), as also shown in Reaction 1. 

The selected 3' or 5* hydroxyl of the nucleoside 
is then masked by attachment thereto of a dimetho- 

10 xytrityl (DMT) group. In Reaction 2 illustrated below, 
the 5 '-hydroxyl is masked, leaving the 3' hydroxyl 
free or available for reaction. Alternatively, the 3' 
hydroxyl could be masked, leaving the 5' hydroxyl free. 
The nucleoside is then converted to' an activated 

15 nucleotide monomer, preferably by attachement to its 3* 
hydroxyl of a phosphorus-containing group which includes 
an activating moiety- When the modified nucleoside is 
properly blocked, modifications of the procedures de- 

20 scribed by Letsinger, et al, Matteucci, et al, or as 
reviewed by Narang, et al can be utilized for oligo- 
nucleotide synthesis. The use of phosphochloridite 
chemistry such as that disclosed by Letsinger, et al, 
is detailed in Examples XVI-X7III. In order to use 

25 phosphoamidite chemistry, a modification of - the proce- 
dure of Matteucci, et al, is used, phosphitylating 
the protected modified nticleoside with methyl chloro 
(N, N-diisopropyl)p.hosphoamidate or methyl chloro- 
phosphomorpholidate, as in the improved procedure of 

30 Dorper, et al [ Nucleic Acids Res . 11:2575(1983)]. Al- 
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ternatively, the protected modified nucleoside can be 
phosphor ylated with 1.2 eq. chlorophenyl dichlorophos- 
phate in trimethylphosphate at room temperature fol- 
lowed by quenching with water to give the 3'-chloro- 
5 phenyl phosphate adduct of the modified nucleoside, such 
adducts being useful in a modification of the phospho- 
triester approach as illustratively reviewed by Narang, 
et al. The diagram illustrates in Reaction 3 the syn- 
thesis of activated monomer nucleotide mits of Formula 

10 II by attachment to the nucleoside 3 ' hydroxyl of a phos- 
phomonochloridite group in which the chlorine functions 
as an activating moiety. 

Coupling or condensation of the selected activated 
nucleotide monomer, i.e. the uracil-based monomer or 

15 the adenine-based monomer, to the ^Terminal unit of a 
growing nucleotide chain is illustrated in Reaction 4 
in the diagram. The nucleotide chain is shown as .in- 
cluding in its right hand end a nucleoside mit having 
a naturally occurring base and having a solid support 

20 or masking group R^ attached to its 3* hydroxyl. The 

illustrated chain also includes one or more (n') nucleo- 
tide units having naturally-occurring bases, said units 
being coupled to the 5* hydroxyl of the nucleoside unit, 
the terminal one of the nucleotide units having a free 
25 hydroxyl at the 5* position. In the coupling reaction 
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